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Pipeline  techniques  have  been  successfully  applied  to  speeding  up  proc¬ 
essing  in  both  general  and  special  purpose  digital  computers.  Application  of 
these  techniques  to  non-recursive  (FIR)  filters  has  been  suggested  and  is  quite 
straightforward.  Application  to  recursive^(TlRT' filters  has  not  previously 
been  shown.  In  this  paper,  the  technique  for  applying  pipeline  techniques  to 
recursive  filters  is  shown  and  the  advantages  and  disadvantages  of  the  tech¬ 
nique  are  discussed.  Using  these  techniques,  recursive  digital  filters  oper¬ 
ating  at  hitherto  impossibly  high  rates  can  be  designed. 
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INTRODUCTION 


Digital  filters  have  an  important  place  in  the  technology  of  processing 
signals.  Because  of  the  sample  theorem,  the  sampling  rate  which  is  also  the 
clock  rate  of  the  filter,  must  be  higher  than  twice  the  highest  frequency 
component  in  the  signal  to  be  filtered.  Thus  the  use  of  digital  filters  in 
real  time  application  involving  high  frequencies  demands  high  sample  rate. 

Pipeline  techniques  are  well  known  [1,2]  v/ays  to  increase  the  clock  rate 
(throughput)  of  a  particular  state-of-the-art  realization  of  a  logic  module. 
For  example,  the  Cray  I  supercomputer  has  a  clock  rate  of  80  Mhz  and  requires  a 
7  stage  pipeline  multiplier  to  perform  64~bit  floating  point  multiplication  at 
the  rate  of  one  product  every  12.5  ns.  [3]. 

Other  workers  have  investigated  residue  number  techniques  to  obtain  high 
rate  digital  filters  [4].  However,  difficulties  in  implementing  division  by  a 
constant  impose  limitations  on  such  realizations,  making  scaling  awkward  and 
precluding  anything  resembling  floating  point. 

Pipeline  techniques  can  thus  be  used  to  produce  multipliers  and  adders 
that  operate  at  the  maximum  clock  rate  possible  for  a  given  state  of  the  art  of 
logic  devices.  For  example,  TRW  makes  a  non-pipeline  16  bit  integer  multiply 
chip  with  input  and  output  registers  that  can  be  operated  at  the  rate  of  about 
9  MHz.  Pipeline  techniques  could  be  applied  to  that  design,  producing  a  new 
chip  that  miglit  have  three  stages  and  operate  at  near  40  MHz. 

Discrete  logic  realization  of  the  basic  functions  of  multiplication  and 
division  can  yield  even  better  performance.  At  the  extreme  we  have  some  of  the 
integrated  circuit  ECL  1 ines.  Based  on  the  experience  gained  with  the  pipel ine 
supercomputers,  it  is  reasonable  to  expect  that  a  three  stage  12-bit  integer 
multiply  and  1  to  2  stage  16-bit  integer  adder  could  be  built  to  operate  at 
around  80  MHz.  Floating  point  operation  could  be  obtained  at  the  same  rate 
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with  a  cost  of  perhaps  only  2  additional  stages  for  the  multiplier  and  3  more 
for  the  adder. 

What  finally  limits  *’he  clock  rate  in  a  pipeline  system  is  the  net  delay 
of  all  logic  and  the  register  delay  between  successive  registers,  however 
pipeline  techniques  will  yield  the  highest  throughput  o»*  clock  rate  possible 
with  a  given  state  of  the  art. 

Now  it  should  be  clear  v*iy  we  should  consider  the  application  of  pipeline 
techniques. 

1)  higher  clock  (sample)  rates  are  possible  for  a  given  state  of  the 

art. 

2)  more  complex  operations  (i.e.  floating  point)  are  possible  for  a  given 
state  of  the  art  and  clock  rate. 

Pipeline  techniques  can  be  easily  applied  to  non-recursive  systems,  that 
is,  systems  without  feedback.  In  the  digital  filter  arena,  this  would 
correspond  to  ncn-recursive  or  Finite  Impulse  Response  (FIR)  filters.  These 
techniques  are  straight  forward  and  proceed  as  follows: 

An  FIR  filter  can  be  described  by  a  transfer  function  in  the  delay 
operator  (2”1  or  0)  as  in  (1) 

I  -  H(z'M  *  *0  *  ®1  *m 

Figure  1  shows  the  realization  of  such  a  filter  using  delayless  add  and 
multiply  modules  as  vfell  as  unit  delays.  That  realization  is  unrealistic  and 
does  not  capitalize  on  the  advantages  of  pipeline  procersors  as  discussed 
above. 

Let  us  suppose  that  we  have  a  1-stage  pipel  ine  adder  and  a  3-stage 
pipe! ine  multipl  ier. 
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Figure  2  shows  a  pipeline  realization  of  the  transfer  function  given  in 
(1).  In  fact,  (1)  is  not  exactly  realized  because  of  the  delay  of  the  adders. 
Instead  the  output  of  the  filter  is  y  delayed  by  7  clock  periods.  This, 
however,  is  not  a  serious  difficulty. 

Thus,  pipeline  realization  of  FIR  filters  is  straight  forward,  costing 
only  an  added  delay  in  the  resulting  output  sequence  rate,  and  perhaps  some 
adoiticnal  adders  over  the  realization  of  figure  1,  if  the  adder  has  more  than 
one  stage. 

What  Chen  about  doinn  this  for  HR  filters?  Applying  pipeline  techniques 
to  recursive  calculations  is  not  so  straightforward  as  it  is  in  the  non¬ 
recursive  case,  and  was  first  reported  in  [1]  in  connection  with  accumulator 
design. 

In  the  next  section  we  shall  consider  the  design  of  recursive  (HR) 
filters.  Following  that,  we  discuss  the  general  attributes  of  the  solution  and 
some  of  the  unsolved  problems  associated  with  the  technique.  Finally,  an 
example  of  a  sixth  order  Butterworth  filter  is  treated,  to  illustrate  the 
technique. 


PIPELINE  RECURSIVE  DESIGN 


An  order  recursive  or  Infinite  Impulse  Response  (HR)  filter  can 
be  characterized  by  a  ratio  of  two  nth  degree  polynomials  In  z”l,  the  delay 
operator,  as  shown  In  (2) 


ag  +  ajz 


■1 


* 


•m 


b^z  •“b2Z  “ 


X  » 


A(z) 

"BTzT 


(2) 


This  representation  can  be  expressed  using  0,  another  notation  for  the 
delay  operator,  as  shown  in  (3).  Time  Is  measured  In  Integer  steps  by  the 
index  1. 


y(i) 


A(D) 

mj 


x(1) 


(jj  ♦  ajD  *  ...  ♦  a^o")  .^(t) 
~  (1  -  bjD  -  ...  -  b/ 


where 


(3) 


d''x(1)  -  x(1-k) 

Rearranging  (3)  yields  the  familiar  recursive  formulation  of  the  filter 
equation,  as  shown  In  (4) 

y(1)  -  (ag  +  a,D  +  ...  +  a^D"*)  x(1)  +  (b^D  +  ...  +  b^)  y(1)  (4) 

Equation  (4)  leads  directly  to  the  canonical  representation  of  the  filter  as 
shown  In  figure  3,  assuming  the  availability  of  a  multiply-add  unit  with  unit 
delay  (one  clock  pulse  period  delay).  This  real  izatlon  actually  does  not  yield 
y,  but  in  fact  produces  Dy,  y  delayed  by  one  clock  pulse  period.  The  canonical 
form  shown  In  this  figure  can  be  realized,  but  unfortunately  because  the 
mrltiply-add  unit  Is  very  complex,  the  realization  will  be  quite  slow.  If  we 
assume  12  bits  of  significance  to  the  representation  of  the  numbers  In  the 
filter,  we  find  that  the  unit  has  two  l-1nput,  12-b1t  constant  multipliers 
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finding  a  three-input  by  12-bit  adder.  All  of  this  complexity  forces  the  clock 
pulse  period  to  be  very  long,  and  the  clock  or  sampling  rate  as  a  consequence 
to  be  low.  A  typical  value  or  clock  rate  using  state-of-the-art  Schottkey 
integrateo  circuit  PROMs  and  adders  would  be  on  the  order  of  10  MHz.  A  slight 
improvement  in  the  rate  of  operation  is  possible  if  we  use  the  basic  module 
shc'.vn  in  figure  4a.  A  delay  of  1  clock  pulse  period  is  produced  by  the  output 
register.  This  module  could  be  built  to  operate  at  a  clock  rate  of  perhaps  12 
MHz,  compared  with  the  real ization  of  figure  3. 

What  is  desired  is  some  means  to  take  advantage  of  higher  clock  rate  pipe- 
1  ine  realiza^’ion  of  the  basic  multiply-add  module,  thus  producing  a  higher  sam¬ 
pling  rate  realization  of  the  recursive  digital  filter. 

In  order  to  increase  the  clock  rate  of  the  multiply-add  unit,  we  first  use 
the  two  input  version,  and  then  insert  several  registers  for  intermediate  re¬ 
sults  in  the  coefficient  multiply  and  adder  sections.  For  example,,  a  2-stage 
version  is  shown  in  Figure  4b  that  could  be  made  to  operate  at  16  MHz.  The  net 
result  is  a  +  k^-stage  (register)  pipeline  multiply-add  unit  as  shown  in 
figure  5,  with  k^,  stages  involved  in  the  coefficient  multiply  and  kg  stages 
in  the  adder. 

When  we  attempt  to  use  the  module  of  figures  4  or  5,  we  find  it  impossible 
to  insert  it  in  the  feedback  loop,  preserving  a  minimum  delay  of  only  one.  So, 
if  we  are  to  use  the  pipeline  module,  some  other  way  must  be  found. 

For  the  purposes  of  the  derivation  which  follows,  we  will  restrict  our 
consideration  to  second-order  filters,  that  is  m»2,  n-2.  This  in  no  way  limits 
the  result,  for  the  process  used  is  not  dependent  on  n,  except  for  the  value  of 
the  individual  coefficients.  On  the  other  hand,  higher  order  filters  can  also 
be  realized  as  the  cascade  of  fn/il  2'’*^  order  filters,  and  at  most  one 
filter  of  lower  order. 
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Now  we  let  n*2  and  consider  that  specific  case  of 
Y  *<'■'>  *0  ♦  ‘ 

T  ■  - ITT”  ■  - . '—.X . .  '5) 

*  B(z  M  1  -  bjZ  ^  -  bgZ  ^ 

or  represented  ’.n  terms  of  the  jlay  operator  D,  the  usual  computer 
representation  of  delay. 

2 

a^  +  a,0  +  a,D 

J - \ - 2 

1  -  bjO  -  b20^ 

Writing  this  In  a  different  form  wo  have 

y(l)  -  (ag  +  ajD  +  a202)x(1)  +  (bjD  +  b2D2)y{l)  (7) 

which  can  also  be  written  In  difference  equation  form: 

yi  -  ajjX^  +  ajx^_j  +  a2X^^2  *  (*^1^1-1  *  Vl-2^ 

For  subsequent  development  we  will  use  (7);  furthermore,  we  will  omit  the  (1) 
notation.  We  will  relate  that  form  to  the  other  forms  where  appropriate. 

Let  us  first  Ignore  the  A(D)  polynomial  by  substituting 

x'  ■  A(D)x  (9) 

Into  (7)  yielding 

y  -  x'  +  (bjOy  +  b2D2y)  (10) 

Delaying  (10)  and  substituting  for  Dy  In  (9)  we  obtain 
y  -  x*  +  bi(Dx'  +  bjD^y  +  bgD^y)  +  b2D2y 
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or 


y  -  (1  +  bjO)x'  V  (bj  +  b2)D^y  +  b^bgO^y  (11) 

(11)  Is  now  a  third  order  dlficrence  ;>qu?tioii  describing  the  same  digital  fil¬ 
ter.  That  is  to  say,  the  filter  described  by  (11)  has  the  same  transfer  char¬ 
acteristic  as  the  one  described  by  (7)  or  (10).  Delaying  (10)  by  0^  and 
substituting  into  (11)  yields 

y  »  [1  +  bjO  +  (bj  +  b2)D^]x' 

+  (2bjb2  +  bj)0^y  +  (b^  +  (12) 

In  general,  successively  more  delayed  versions  of  (10)  can  be  substituted, 
raising  the  order  of  the  difference  equation,  but,  more  importantly,  raising 
also  the  minimum  delay  associated  with  the  feedback  y  terms.  The  general 
higher  order  difference  equation  has  the  following  general  form,  provided  we 
started  from  a  second-order  equation; 

y  »  Cl  0  +  0^  >  ...  +  dP]x' 

+  bJPjoP^V  +  bJP^DP^^y  (13) 

In  fact,  if  we  began  with  an  n^^  order  difference  equation,  we  would  have 
an  equation  of  p^-n  order  resulting  from  the  foregoing  process  as  shown  in  (14) 

y  -  [1  +  aJP'  0  +  ...  +  a^P^  0p]x’ 

+  b^JoP^V  +  ...  +  bJPy^"y  (14) 

The  coefficients  ajP^  and  b|P^  can  be  calculated  by  following  the  process 
described  above  in  detail.  For  example,  it  should  be  clear  from  (12)  that 
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,(2) 

1 

-  h  •  ,(")  - 

-  Dj  , 

“  •>2 

,(2) 

3 

=  2b^b2  +  bj 

;  bj^^  =•  bg  +  b?b2 

(15) 

v,h:.r3  the  orl^jinai  b  coefficients  are  from  (10).  We  will  cover  the  process  In 
detail  a  little  later  in  this  section. 

flow  let  us  recall  vdiat  x'  is  in  terms  of  the  original  IIR  filter  as  des¬ 
cribed  in  (7).  Substituting  x'  =»  A(D)x  into  (13)  yields: 

y  =  [1  +  n  +  ...  +  nP]  [ag  +  a^D  +  a2D^]x 

+  (16) 

This  equation  v^hon  multiplied  out  for  the  case  of  p=2  yields  (17) 

y  =>  +  ap)o  +  ...  +  a|^)D^]x 

+  oS  *■  bf  y]  (17a) 


w'  =  [a^^)  ap)D  +  ...  +  a,^^)D'*]x 

+  [b^^)D^  w*  +  w'3 


(l«a) 


These  tw  equations  look  very  similar  now,  the  only  difference  being  that 
the  non-recursive  parts  (18a)  is  the  non-recursive  part  of  (I7a)  delayed  by 
a  +  4.  If  both  sides  of  (17a)  are  delayed  by  that  amount,  (17b)  results: 


=  0^^^  [a(2)  .  a(2)o  .  ...  .  aJ'^D^lx 
+  [b(^)D^‘^^y  +  b(^)D^'*’®y] 


(17b) 
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Finally,  It  should  be  noted  that  In  (17b)  Is  the  same  as  w'  In 

(18a).  substituting  D^^^y  »  w'  In  (17b)  produces  (17c): 

v^'  a  [a^^^  +  a|^^  +,..•»■  a|^^0^3x 

+  Cb^^^O^  w'  +  w*]  (17c) 

which  Is  the  sane  as  (18a). 

VIhat  this  all  means  Is  that  the  structure  of  figure  6  will  produce  an  out¬ 
put  w'  which  Is  simply  the  desired  signal  y,  delayed  by  4+4  sampling  periods. 
This  digital  filter  structure  uses  pipeline  logic  units  in  both  the  recursive 
and  non-recursIve  portions  to  realize  the  desired  output. 

The  design  stHl  needs  to  be  completed,  even  though  the  heart  of  the 
structure  has  |)een  developed.  Figure  7  shows  the  structure  of  the  ncn- 
recurslve  portion  of  the  filter  to  complete  the  example. 

The  circuit  of  figure  7  produces  D^Cag  +  aiO  +..,+  a*D^3x  which 
corresponds  to'  the  +  axO  +...+  aaO^^x  term  re(^1red  as  the  out¬ 

put  of  the  non|- recursive  part  of  figure  7.  Therefore,  the  pipeline  digital 
filter  of  figure  7  combined  with  figure  6  produces  0^'^y,  that  Is  y,  de¬ 
layed  by  8  sample  periods. 

Now  we  j  return  to  the  calculation  of  the  and  bjP^ 

coefficients.  More  general  relations  than  (15)  can  be  developed  In  difference 
equation  form  with  respect  to  p. 

Starting  with  the  p  augmented  order  (p+2  order)  difference  equation  (13) 
we  can  derive  the  p+1  augmented  order  equation  by  substituting  for 
C(IO)  delayed  by  D**"*^^]  Into  (13). 
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as  shown  1n  (16)  and  (18).  Table  2  shows  the  values  of  a^^^  for  p 
from  0  through  5. 

The  unique  structure  of  this  realization,  which  differs  from  conventional 
realizations  of  digital  filters  Is  contained  in  the  recursive  portion,  where 
the  basic  feedback  loop  Involving  the  longest  delay  passes  through  p4-2  pipeline 
stages,  for  the  realization  of  a  2^^  order  filter.  In  the  example  devel¬ 
oped,  p  was  2,  causing  us  to  use  a  order  pipeline  filter  represen¬ 
tation  of  the  original  2”*^  order  filter. 

In  general,  we  assume  we  have  k^  stage  pipeline  add  units  and  k„  stage 
pipeline  multipliers.  The  structures  of  the  rallzatlon  of  the  feedback  portion 
of  the  filter  would  be  as  shown  In  figure  8.  Since  the  maximum  delay  around 
the  loop  must  match  the  p  augmented  difference  equation  order,  we  have 

p-2k3  +  k^-2  (21) 

The  FIR  portion  of  the  filter  Is  shown  In  general  terms  In  figure  14. 
From  this  figure.  It  should  be  clear  that 

»  -  fiogjCk.iJk,  ♦  k,  ♦ 

and  that  the  over  all  delay  Is  therefore 

'  » +  2k,  .  fl09j(k,)]k^.3kj*k„  (ZZ) 

STABILITY  ISSUES 

In  the  previous  section  we  have  seen  the  general  orocess  of  realization  of 
a  digital  filter  transfer  function  using  pipel Inemultiply-add  units  with  delay 
2.  We  have  also  shown  a  realization  using  2-stage  multiply-add  units,  inu 
essence  of  the  techlque  Is  to  represent  a  second-order  section  by  means  of  a 
(2+p)^^  order  section  of  a  particular  form,  where  p  Is  determined  by  the 
number  of  stages  In  the  pipel Ine  nwltlply  and  add  units. 


We  know  from  other  considerations  that  the  transfer  function  of  the  higher 
order  real  Izatlon  roust  in  fact  be  the  same  as  the  original  and  therefore  must 
have  the  same  poles  and  zeros  in  the  z  plane.  Thus,  the  operation  »^*i1ch  trans¬ 
forms  (7)  Into  (17)  can  be  thought  of  In  the  following  way: 


(1 


+  ajz"^  +  agz”^)  (1  +  ojP'z”^  ^ 

-  bjz"-*  -  b2Z^)  (1  +  a^P^z'^  +  ...  + 


Jp)  +  ,(P)  ,-l  +  +  JP)  ,-P-^ 

ag  +  a^  z  +  ...  +  ap^2  * 

p+1  p+2 


(23) 


(24) 


Where  the  coefficients  of  the  a  polynomial  depend  on  the  original  bj  and 
b2  of  (5)  and  (7).  These  o  coefficients  are  governed  by  (201  and  some  were 
tabulated  in  Table  lb.  Thus,  we  .raise  the  order  of  the  denominator,  Introduc- 

i 

ing  p  new  poles  and  corresponding  cancel liiig  zeros.  These  new  poles  correspond 
to  the  roots  of  | 

0  •  1  D  4-  ...  dP  (25) 

j 

The  roots  of  a(0)  of  course  are  the  poles  in  the  z*^  plane  and  are 
the  reciprocal  of  the  poles  In  z? plane. 

One  concern  in  the  realization  is  that  the  filter  be  stable,  that  is,  that 

) 

it  have  an  Impulse  response  which  decays  to  zero,  A  sufficent  condition  for 
the  stability  of  a  digital  filter  with  transfer  function  Hfz”^)  is  that 
the  poles  of  H(z’^)  in  the  z”^  plane  lie  outside  the  unit  circle, 
and  that  the  order  of  the  numerator  (-m)  be  less  than  or  equal  In  magnitude  to 
the  order  (-n)  of  the  denominator.  A  more  familiar  sufficient  condition  is 
that  the  poles  of  the  transfer  function  of  z  (H(z))  be  inside  the  unit  circle. 
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Now,  if  we  start  with  a  stable  filter,  we  would  like  tj  be  assured  that 
the  poles  of  the  augmented  order  filter  all  be  outside  the  wit  circle  in  the 
plane*  We  desire  this  because,  even  though  the  added  poles  are 
cancelled  by  the  zeros  of  (1  +  +  ...  +  realization  imperfec¬ 

tions  will  prevent  exact  cancellation  and  the  augmented  filter  would  be 
unstable. 

In  order  to  examine  the  stability  question,  we  will  make  use  of  Jury's 
stability  test  [6]  as  applied  to  the  successively  higher  degree  denominator 
polynomials  In  z  as  p  Increases. 

We  start  with  the  denominator  of  (5),  written  as  a  polynomial  in  z 


P(z)  ■  z^  -  bjz  -  b2 


(26) 


To  Insure  that  the  original  poles  of  (5)  are  Inside  the  unit  circle  and 
hence  that  the  original  filter  Is  stable,  we  apply  Jury's  test  with  necessary 
conditions: 


P(l)  -  1  -  bj  -  b2  >  0 
(-1)'’P{-1)  -  1  +  bj  -  b2 
and  sufficient  condition; 


>  0 


I  b,  I  <  1 


(27) 


Hence^  for  this  second-order  digital  filter  to  be  stable,  the  values  of  bi 

1 

and  b2  must  be  In  the  shaded  area  as  shown  In  Fig.  10.  ftote  that  this  is  the 

case  when  p  >  0,  i.e.,  no  augmentation.  | 

1 

Now,  assuming  that  the  original  flljter  transfer  function  (5)  is  stable, 
i.e.,  (27)  is  satisfied,  it  is  desirabll  to  determine  conditions  to  assure 
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stability  of  the  new  p  poles  introduced  in  the  system  when  the  transfer 
function  is  p-augmented.  For  p  »  1,  the  new  polynomial  introduced  in  the 
numerator  and  denominator  is 

F(z)  -  2  +  bj  (28) 


Applying  Jury's  test,  conditions  for  stability  are 
F{1)  -  1  +  >  0 

(-1)"F(-1)  «  1  -  bj  >  0 


(29) 


This  condition,  |  bj  ]  <  1,  is  shown  graphically  in  figure  11.  Similarly,  for 
p  ■  2,  the  added  polynomial  is 


F(z)  -  2^  +  bj2  +  (bj  +  b2)  (30) 

Jury's  test  gives  conditions  for  stable  roots.  The  necessary  conditions  are 
F(l)  -  1  +  bj  +  (bj  ♦  b2)  >  0  \ 

(.1)'’F(-1)  -  1  -  bj  +  (b^  ♦  b2)  >  0  /  (31) 


and  the  sufficient  condition  is 


I  bf .  bj  I  <  1 


The  region  defined  by  conditions  (31)  is  shown  graphically  in  figure  12. 
Finally,  for  p  ■  3 

F(z)  -  2^  +  bj2^  +  (b^  +  b2)2  +  (bj  +  2bjb2) 
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and  the  conditions  for  stable  poles  are  (figure  13): 

F(l)  -  1  +  bj  +  (bj  +  b2)  +  (bj  +  abjbg)  >  0 
(-1)"f(-1)  -  1  -  bj  +  (bj  +  bg)  -  (bj  +  Zbjbg)  >  0 

I  bj  +  Zbjbg  I  <  1 

I  (b^  +  Zbjb,)^  -  1  I  >  I  bi(bi  +  2bjb2)  -  (b^  +  bg)  | 

This  procedure  can  be  continued  for  higher  values  of  p;  In  fact  general 
Stability  tests  based  on  Jury's  test  have  been  developed  [63.  Unfortunately, 
those  tests  are  cunjbersome  to  apply. 

Instead,  let  us  take  another  approach. 

From  figures  10-13  It  may  be  observed  that  as  the  value  of  p  Increases, 
the  area  of  stability  (the  shaded  areas  In  the  figures)  tends  to  Increase.  It 
was  seen  graphically  that  as  p  Increased,  the  stable  region  came  to  be  closer 
and  closer  to  the  original  shaded  area  for  p  -  0,  I.e.,  figure  10,  This  leads 
to  the  obvious  conjecture  that  all  stable  filters  have  a  stable  augmented 
filter  for  some  large  enough  p. 

We  will  follow  the  procedure  suggested  by  Voelcker  [73  for  Block  Filters 
to  show  that  for  a  large  value  of  augmentation,  p,  the  pipelined  version  of  the 
original  filter  will  always  be  stable,  provided  that  the  original  second  order 
filter  Is  stable.  This  Is  a  very  important  and  useful  conclusion  and  Implies 
that  an  augmented  stable  realization  is  always  possible  If  the  original  trans¬ 
fer  function  Is  stable.  The  proof  for  this  fact  follows: 

Since*  the  process  of  augmentation  Implies  multiplying  the  numerator  and 
the  denominator  by  tiie  same  polynomial,  the  original  transfer  function  may  be 
equated  to  the  augmented  transfer  function  as 


•  •  • 


1  -  bjZ 


H(z) 

r" 


N(z)(4'’'  +  +  ...  + 


.here  H(i)  H  the  numeretor  of  the  orlgtral  transfer  function.  Oenotlng  the 
denominator  of  the  original  transfer  function  as  D(z),  since  this  originai  fil¬ 
ter  is  stable,  the  roots  of  D{z)  are  inside  the  unit  circle.  Figure  H( a) 
sho«  the  original  filter  where  Hp(t)  is  a  pole  only  function  representin, 

the  denominator.  (l/D(z))  of  the  original  filter. 

The  process  of  augmenting  the  difference  equation  suggests  building  the 
filter  as  shown  In  figure  14(b)  where 


and 

Thus 


.(t!  >  4’’*  *  •••  *  4''*'''' 

a(z)  -  b^l  t  b^|a-‘ 

T- 


(33). 


Hence,  the  nonrecursIve  transfer  function  a{2)  may  be  written  as 

o(z)  -  Hp{2)  -  §11} 

Since  H  Is  a  pole  only  transfer  function 


(34) 


Hp(z) 


1 


“7 

n 

1-1 


7 

n  1-4 


2 

•L 


1-1 1  - 
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(•* 


(35) 


Thus, 


Hp(2) 


Comparing  (34)  and  (35)  and,  since  li'p(z)  a  finite  impulse  response 
transfer  function,  equating  1i|j(2)  and  a(z) 


2  a  2 
G(2)  _  ^  *i^i 

unr  2^  (—“Tz 

i-1  ’ 


'  Thus 


The  intent  of  this  procedure  is  to  prove  that,  for  some  high  value  of  p,  all 
roots  of  0(2)  are  inside  the  unit  circle  in  the  2-domain,  From  (33) 

0(2)a(2)  -  1  -  2“^P*^^6(2)  (37) 

Since  0(2)  is  the  denominator  of  the  original  transfer  function,  all  roots  of 
0(2)  are  inside  the  unit  circle.  Hence  all  roots  of  0(2)  are  also  inside  if, 
and  only  if,  all  roots  of  the  right  hand  expression  of  (37)  are  inside  the  unit 
circle.  To  do  this,  let  tilde  (H  denote  the  result  of  the  mapping 
2”^-*  2,  i.e.,  ?*(2)  ■  f(2“^)  and  ^(2)  is  defined  as 

T(2)  “  1  -  2^P*^^^(2)  »tf(z)a(2)  (38) 

All  roots  of  0(2)  are  exterior  (outside  the  unit  circle).  Hence,  all  roots  of 
a(2)  are  exterior  if  all  roots  of  A(2)  are  exterior.  To  prove  this,  Rouche's 
theorem  is  used  which  states  that:  "If  f(2)  and  g(2)  are  analytic  inside  and 
on  a  closed  contour  C,  and  |  g(2)  j  <  J  f(2)  |  on  C,  then  f(2)  and  f(2)  *  g(2)  have 
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the  same  number  of  zeroes  inside  C."  Clearly,  A  in  [Z^)  is  analytic  within  and 
on  the  unit  circle.  The  constant  "1"  has  no  interior  zeroes.  Hence  A(z)  wil’ 
have  no  interior  zeroes  if,  on  the  unit  circle. 


I  I  <  I  .  z  . 

From  (4.20), 


z=e' 


4  3 


i=l  ^  " 


z.e' 


j£/ 


(39) 


(40) 


But  I  Zi  I  <  1  for  all  i  because  these  are  poles  of  the  original  filter.  Thus, 
for  some  value  of  p  greater  than  some  critical  value,  equation  (40)  will  satis¬ 
fied,  Note  that  (40)  may  be  satisfied  for  smaller  value  of  p  also. 

From  the  above  discussion,  it  may  be  concluded  that  for  some  high  value  of 
augmentation,  it  is  always  possible  to  ensure  that  a(z)  has  roots  inside  the 
unit  circle.  Since  these  roots  are  the  new  additional  poles  of  the  augmented 
system,  the  new  high  order  transfer  function  would  be  stable. 

As  a  result  of  this  fact,  the  following  design  method  is  suggested. 

We  assume  that  the  designer  is  given  a  stable  recursive  digital  filter 
transfer  function  H(z)  and  a  desired  sampling  (clock)  rate  of  operation,  fj. 

1.  Find  stage  pipenn..  multiply  and  kg  stage  pipeline  add  units 
that  will  operate  at  clock  frequency  fj. 

2.  Factor  H(z)  into  t  (t  -  1  if  n  is  odd)  2^*^  order  (and  if  n  is 

s  t 

odd,  at  most  one  1  order)  transfer  functions. 

H(z)  =  H,(z)  .  H2(z)  ...lyz) 

where  i  =•  rn/2'I 

3.  For  each  H,-(z),  i  =  1,  2,  ...  i.  Realize  H-j(z)  as  follows: 
a.  Set  p  »  2kg  +  k;n  -  2, 
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I 


b.  Oetermfne  the  polynomial  1  -  ■  Pp(z”^)t 

c.  Solve  for  the  roots  of  Bp(z“^)  -0  (41) 

d.  If  all  roots  of  ■  0  are  outside  the  unit  clr- 

P 

cle,  go  to  e,  else  set  p  ■  p  ♦  1  and  return  to  step  3b. 

e.  H^(2)  can  be  realized  by  a  stable  p  augmented  order  difference 

equat Ion 


+  >  +  a^P^r'P'^ 

+  a^  z  >  ...  +  a  J,z 


,(P) 

*0  -1  - 

1  .  b^P^z'P’^  -  b^P^z’P"^ 
^  Vl  P+2 


(24) 


where  p  2  2k3  +  k,„  -  2 

Because  the  last  result*  we  know  that  we  can  always  find  a  large 
enough  p  so  that  (24)  will  be  stable. 

In  the  next  section,  we  will  Illustrate  the  method  with  an  example  filter 
taken  from  the  1 Iterature. 


EXAMPLE 


We  will  treat  the  Butterworth  filter  considered  by  Oppenhelm  and  Shafer 
[8].  This  Is  a  sixth  order  filter  whose  original  transfer  function  H(s)  Is 
given  by 


H(s) 


_  0.20238 _ _ _ 

( s^+0 . 396S+0. 5871 ) ( s^+1 . 083S+0 . 5871 ) ( s^+1 . 4802s+0 . 5871 ) 


(42) 


Applying  the  b1-l Inear  transformation,  a  sixth  order  difference  equation  Is 
obtained  that  can  be  factored  In  three  second  order  filters. 


H(z)  -  Hj(z)  •  H2(z)  •  H3(z) 


(43) 
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where 


u  -  .0007378  (1  +  22"^  +  2*^) 
^  (1  -  1.2686  2  ^  +  0.7051  z 

-1  .  --2, 


H,(z) 


H,(z) 


(1  +  22*^  +  z^^) 


(1  -1.0106  z^  *  0.3583  2"'^) 


(1  +  22“^  +  2~^) 


(1  -  0.9044  ^  +  0.2155  2"*^) 


■=r. 


The  poles  In  the  z'^plare  of  these  equations  are  as  follows: 


Pj  -  0.8996  t  J.7804 
I  Pj  j  »  1.1909 
Pg  •  1.4103  ±  j.8956 
I  p2  I  -  1.6706 
P3  -  2.0984  ±  j.4870 
I  P3  I  -  2.1542 


(44a) 

(44b) 

(44c) 

(45a) 

(45b) 

(45c) 


Thus  all  three  of  the  factoring  second  order  transfer  functions  art 
stable. 

p  augmented  order  difference  equations  of  the  form  of  (16)  were  derived 
for  each  of  the  three  sections,  for  values  of  p  from  1  through  5.  and 
H2  were  unstable  for  p-1  and  stable  for  all  values  of  p  from  2  through  5. 
H3  was  stable  for  all  values  of  p  from  1  through  5. 

If  we  had  a  2  stage  adder  and  a  2  stage  multiplier,  (21)  would  yield  a  p 
of  4,  which  for  our  example  adds  stable  poles  to  each  section.  The  general 
structure  of  each  of  the  2nd  order  (4-augmented)  sections  would  be  as  shown  In 
figure  15. 


/ 
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Finally,  the  values  for  the  coefficient  of  each  of  the  multipliers  would 

be  as  shown  in  table  3. 

Ml 

Note  that  the  a'  ‘  coefficient  for  the  first  section  are  all  on  the 
order  of  .001  in  magnitude.  These  coefficient  could  easily  be  computed  to  more 
significant  figures,  although  realization  of  more  significant  figures  in  the 
coefficient  of  an  actual  filter  would  require  either  scaling  or  the  use  of  a 
floating  point  number  system. 

This  example  shows  how  an  HR  filter  can  be  realized  using  high  rate  pipe¬ 
line  modules  ,  with  the  ultimate  objective  of  achieving  a  higher  sample  rate 
than  possible  with  non-pipel ined  multiplier  and  adders. 
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SUWARY  AND  CONCLUSIONS 


In  this  paper  we  have  developed  a  method  for  applying  pipeline  techniques 
to  the  design  of  high  speed  recursive  digital  filters.  Using  these  techniques, 
recursive  digital  filters  operating  at  rates  hitherto  Impossible  can  be 
designed.  The  general  structure  of  the  filter  and  the  method  for  calculating 
the  multiplier  coefficients  Is  presented.  The  stability  of  the  resuTtfng  real¬ 
ization  has  been  Investigated  and  a  technique  for  ascertaining  the  stability  of 
the  realization  Is  presented. 

A  significant  example  taken  from  the  literature  Illustrates  the  technique 
and  demonstrates  the  existence  of  a  stable,  high  rate  pipeline  realization  of  a 
practically  useful  filter. 
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Figure  8 


Polynomifll*  F(2 )  *  2  »  0 

Conditions  for  Stability  *• 

l+b,>0 .  © 

t-b,>0 . . . ® 


2 

Polynomial*  F(2)»2^'<’b^2  +  {b|+b2^ 
Conditions  for  Stability* 


Figure  12.  Condition  for  stability  of  the  augmented  filter:  p*2 
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Conditions  for  Stability:  ^ 

l+b^+(b^+b2)+(b^+2b^b2)  >  0  .  W 

l-DT^(b^b2)-(b^2b,b2)  . . © 

. . ® 

l(b^2l.,b2)2-l|  >  Ibjib^.Zbjbjl-lb^^b^l..© 


Figure  14. 


(b>  I 

(a)  OrIgfnaT  transfer  function  (b)  Iqui valent  augmented  transfer 
function  1 
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